Clustering in complex networks. II. Percolation properties 
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The percolation properties of clustered networks are analyzed in detail. In the case of weak 
clustering, we present an analytical approach that allows to find the critical threshold and the 
size of the giant component. Numerical simulations confirm the accuracy of our results. In more 
general terms, we show that weak clustering hinders the onset of the giant component whereas 
strong clustering favors its appearance. This is a direct consequence of the differences in the fc-core 
structure of the networks, which are found to be totally different depending on the level of clustering. 
An empirical analysis of a real social network confirms our predictions. 
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I. INTRODUCTION 



The general framework introduced in the preceding pa- 
per pj to analyze clustering in complex networks provide 
us with the necessary tools to tackle the study of the 
percolation properties of clustered networks. The intro- 
duction of clustering in percolation analysis represents a 
theoretical challenge due to the fact that previous analyt- 
ical approaches, applied to random two-point correlated 
directed and undirected networks 0,|3|,0>1SIH0>I1jEI0]j 
are based on the idea of branching process. One starts 
from a given vertex and follows all the edges attached 
to it. Then, the process is repeated again starting from 
the neighboring vertices avoiding the source one. All the 
vertices reachable from a given one belong to the same 
connected component and the network is said to be in 
the percolated phase when the largest component reaches 
proportions of the order of the network size, becoming 
the so-called giant connected component (GCC). This 
scheme works well when the network is locally tree-like 
and, thus, the clustering coefficient is very small. Real 
networks, however, are shown to have a significant level 
of clustering -even for very large networks- that may 
change the percolation properties significantly. 

Aside from the theoretical importance of percolation 
itself, there is a significant practical interest. Percolation 
is strongly related to epidemic processes. In fact, the 
simplest model for epidemic spreading, the Susceptible- 
Infected- Removed model ^1 0] , can easily be mapped 
into a bond percolation problem 0, 0, 0, . In its 
simplest formulation, an infected individual becomes in- 
fected for a random time t r following a Poisson process, 



that is, ip r (t r ) = Se 



-St r 



On the other hand, when an 



infected individual is in contact with a susceptible one, 
it takes a random time ti to infect it, this process being 
a Poisson process as well, that is, <f>i(U) = \e~ xti . The 
probability that an infected individual infects a suscep- 



tible neighbor before it becomes removed is then 
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where & r (t) = L Vv(t)cZt is the probability that the 
vertex remains infected for a time larger than t. Since 
the infection uses the network as a template to spread, 
the infection process can be understood as a bond per- 
colation problem over the original network when each 
edge is removed with probability qi n f = 1 — pi n f . The 
percolation threshold corresponds in this mapping to the 
onset of pandemic infections whereas the size of the giant 
connected cluster corresponds to the number of infected 
individuals. 

In this second paper, we present analytical results 
for percolation in random networks with weak transi- 
tivity, that is, with degree-dependent clustering coeffi- 
cient, c(fc), below (k — 1) , which extends and com- 
pletes material previously published in [l7|. In the case 
of percolation in the presence of strong transitivity -with 
c(k) > (k — in a given domain-, we present here new 
counterintuitive results which demonstrate that, in the 
percolation process, strong clustering favors the onset of 
the giant component whereas weak clustering hinders it. 
Furthermore, we show how these outcomes explain previ- 
ous results by other authors ■ We also discuss that 
these properties are intimately related to the structural 
organization of networks, in particular to their k-core de- 
composition 0, 0| ■ We end this paper by taking a look 
at the Pretty-Good-Privacy (PGP) web of trust 22], a 
large social network which turns out to be a nice example 
of a real system where our predictions apply. 



II. PERCOLATION IN WEAKLY CLUSTERED 
NETWORKS 

The standard percolation formalism based on branch- 
ing processes overcounts the size of a node's second neigh- 
borhood when clustering is present. To correct for this 
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effect, the usual procedure can be modified in the follow- 
ing way. One starts from a given vertex and follows all its 
edges. However, once placed in one of the neighbors, the 
next edges to follow are those not pointing to the neigh- 
borhood of the source vertex [2^ (the edges pointing to 
the neighborhood of the source vertex are the ones re- 
sponsible for clustering). It is worth to notice that, even 
in this scheme, we are neglecting the fact that higher 
order loops may be present in the network. In partic- 
ular, squares will connect vertices in the first neighbor- 
hood with a common vertex in the second neighborhood, 
over-counting it. Besides, when the multiplicity of the 
edges is large -by multiplicity, m, we mean the number 
of triangles passing through an edge-, squares induced 
by the merge of two triangles that share an edge appear. 
Therefore, the implementation of the strategy outlined 
above will become exact in the case of disjoint triangles 
but it will represent an upper bound to the real size of 
the giant component when triangles jam into edges. As 
explained in the preceding paper [lj, weakly clustered 
networks are those which have a degree-dependent clus- 
tering coefficient c(k) < (k — 1) ,Vfc, implying that edge 
multiplicity is small and triangles disjoint. In contrast, 
strongly clustered networks have high edge multiplicity 



where P{k'\k) is the probability that a vertex of degree 
k is connected to a vertex of degree fc'|29j and where 
we have defined </>(m|fcfc') as the probability that an edge 
connecting two vertices of degrees k and k' has multiplic- 
ity m. The multiplicity matrix rrikk' can be computed as 
the first moment of (f>(rn\kk'), i. e., 

m tk' 

m k k> = E m(f)(m\kk'), (4) 

m— 

where m^r — min(k, k') — 1. For randomly assem- 
bled networks, we can make use of the probabilistic in- 
terpretation of the edge clustering coefficient c(fc, k') — 
irikk' I m kk< 0> 01 an d write that 

ct>{m\kk') = ( m w \ [c(k,k')] m [l-c(k,k')} m ^'- m . (5) 

In principle, this particular form of the distribution of m 
is to be taken as an approximation. However, as we will 
see, in the case of randomly assembled networks it works 
extremely well. 

We define the generating function of g(s\k) as 

5(z|fc)^z s . 9 ( S |fc). (6) 

s 



and, thus, triangles are forced to share edges. 

With this in mind, let us start by defining the proba- 
bility that a given vertex has s reachable vertices, G(s). 
For very heterogeneous networks this function is not very 
representative and we have to define the same probabil- 
ity but conditioned to the degree of the source vertex, 
G(s\k). These two functions are trivially related through 



G(s)=J2P(k)G(s\k) (2) 

fc 

where P(k) is the degree distribution. Finally, we need to 
introduce an extra function, g(s\k), which measures the 
probability that a vertex can reach s other vertices given 
that it is connected to a vertex v of degree fc, and that 
it cannot visit either v nor its neighborhood. This last 
condition guaranties that we do not over-count contribu- 
tions due to triangles. We can find a recursion relation 
for this function taking into account that now the process 
is a branching one with the constraint that at each gen- 
eration point we can only use the free edges to continue 
the process. Then, function g(s\k) satisfies 



(3) 

I 

Using this transformation, Eq. J3J) reads 
g(z\k) = *E E P(k'\k)4>(m\kk') [g{z\k')f '" m " 1 , (7) 

k' m 

which is a closed set of equations for the functions g's. 
Finally, G(s\k) and g(s\k) are related through 

G(z\k) = z[g(z\k)] k . (8) 

The percolation transition takes place when Eq. Q, eval- 
uated at z = 1, admits as a stable solution g(z — l\k) — 
£(fc) < 1, that is, there is a finite probability (1 — £(fc)) 
that the branching process extends up to infinity, mean- 
ing that a giant connected component has been formed. 
Since g(z = l\k) = 1 is always a fixed point of Eq. J7J, 
the onset of the giant component is the point at which 
this solution becomes unstable. To perform the stability 
analysis, we linearize Eq. (JJJ by plugging in a solution of 
the form g(z = l\k) w 1 +x(fc)e. In the limit e — > 0, this 
operation yields 

x (k) = - 1 - m kkl )P{k'\k) X {k') (9) 

k' 

The critical percolation point is ruled by the maximum 
eigenvalue, A m , of the matrix (fc' — 1 — mkk')P(k'\k). 



J 



9(s\k) -EE P(k'\k)cf>(m\kk') £ g( Sl \k')g(s 2 \k') ■ ■ ■ g(s k ,- m -i\k')6 s , 1+Sl+a2+ ... Sk ,_ 

k' m si,S2,'" 



3 



• 


<k> 


=2.0 


♦ 


<k> 


=1.9 


■ 


</<> 


=1.8 


A 


<k> 


=1.7 




<k> 


=1.6 




FIG. 1: Values of the parameter P(l, 1) as a function of Co 
obtained from numerical simulations as compared to the an- 
alytical solution given by Eq. H17II . for different values of the 
average degree and an exponential degree distribution. Sym- 
bols are numerical simulations using the algorithm of Ref. 
and solid lines correspond to the analytical solution. 




FIG. 2: Relative size of the giant component in the case of 
c(k) = co/(k — 1) as a function of Co for different values of the 
average degree. Symbols are numerical simulations using the 
algorithm of Ref. [25| for an exponential degree distribution. 
Solid lines correspond to the numerical solution of the set of 
equations @. 



co('Tio) is a certain function of mo to be determined. The 
second simplification assumes the absence of two-vertex 
correlations but, when clustering is present, uncorrela- 
tion can be attained for all degree classes except for the 
degree class k — 1 (see discussion in the preceding pa- 
per Q). The reason is that vertices of degree k = 1 
cannot participate in triangles and, therefore, connec- 
tions involving these vertices are forced to follow a differ- 
ent pattern. Consequently, assuming the factorization of 
P(k,k') for k, k' > 1, the transition probability P(k'\k) 
takes the form 



P(k'\k) = 



(1 

"PUT 



p(i) 

<fc) 



(k) 



k A k' > 1 



(k) k'P(k') 



CI p W\ 



(k) 



P(i) 



(k) 



k = 1 A k' > 1 
k > 1 A k' = 1 

k = k' = 1 

where P(l, 1) is the probability that a randomly cho- 
sen edge connects two nodes of degree 1. If clustering 
is absent, then mo = and P(l, 1) can factorize as 
P(l, 1) = P 2 (l)/(k) 2 . Otherwise, if clustering is finite, 
we can assume that all edges that are not involved into 
triangles are realized in the most random form, which 
leads to the following expression 



p 2 (i) 



l 



(k) 2 l-E c /E' 



(13) 



where E c is the number of edges participating in triangles 
and E is the total number of edges. E c can be computed 
as 



E c = \ ^a lJ <d{m lJ - 1), 



(14) 



When A m > 1, the solution g(z — l\k) = 1 becomes 
unstable leading the network to the percolated phase, 
in which a macroscopic fraction of the system becomes 
globally connected. The relative size of the GCC can 
then be computed as 



gcc=l-{Uk)\ k )- 



(10) 



To be able to derive an analytic expression for the per- 
colation condition, we have to assume some simplifica- 
tions at this point. On one hand, we take rrikk' = tio, 
with uiq a constant within the interval [0, 1], that is, we 
restrict to networks with weak transitivity. In this situ- 
ation, making use of the closure condition derived in the 
preceding paper, 



Y j m k k>P{k,k')=k{k-l)c{k) 



P(k) 
(k) ' 



(11) 



we can see that the degree-dependent clustering coef- 
ficient takes the form c(fc) = co(mo)(k — 1) , where 



where Q(rriij — 1) is the Heaviside step function. This 
equation can now be rewritten as 



k,k>=2 



P{k,k')(e(m-l)\k,k') 



(15) 



where the last average is taken over the set of edges con- 
necting nodes of degrees k and k' , that is, 

(6(m - l)|fc, k') = ^2 4>{m\kk') = 1 - <j>(0\kk r ), (16) 



which can be computed using Eq. . Using this expres- 
sion, combined with the two-point correlation function, 
Eq. (|12fl and Eq. JT3J, we can write the following equa- 
tion for -P(l, 1) = 



W , 2P(1) 



i - (d>y (k) 



p 2 (\) 



(kni-m 



0, (17) 



where (4)' is the average of 4>(0\kk') over the set of ver- 
tices of degrees larger than 1. P(l, 1) corresponds to the 
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smallest solution of this quadratic equation. Finally, us- 
ing Eq. I|llf) . the clustering factor co(toq) can be written 



co (mo J 



rn - 



^(1,1) 



(1 



-jky> 



(18) 



Using the results above, the maximum eigenvalue of the 
matrix (As' — 1 — rrikk')P(k'\k) can be analytically com- 
puted, yielding a percolation condition given by 



(fc(fc-l)) 
(As) 



> (l + c (m )) 



mo 



co (to ) 



(1 



P(l) 

(As) 



)• (19) 



It is easy to prove that the right hand side of this equation 
is always larger or equal to 1. This means that weakly 
clustered networks percolate at a higher density of con- 
nections as compared to the unclustered ones. For very 
low clustering, this term converges to 1 and so we re- 
cover the percolation threshold of random networks with 
a given degree distribution. 

There is a particular case in which the symbiosis of a 
specific form of the degree distribution and a weak clus- 
tering in the frontier, c(k) = (k — 1) , maintains the 
critical point unchanged with respect to the unclustered 
classical random graph. This model is studied in detail in 
Ref. 0| as a natural extension of the Erdos-Rcnyi ran- 
dom graph [26l |27| , where each possible triangle among 
a fixed number of vertices is realized with a given prob- 
ability. In the thermodynamic limit, and for sparse net- 
works, all edges in the ensemble have fixed multiplicity, 
to = 1, and the generating function of the resulting de- 
gree distribution is P(z) — exp((As)(z 2 — l)/2). Since 
odd degrees are not present, P(l) = P(l, 1) = and the 
critical condition Eq. (|19J) becomes (k(k — l))/(k) > 2 
which, for this particular degree distribution translates 
into (As) > 1, that is, the same percolation condition that 
applies for the classical random graph. 

To check the accuracy of the formalism, we gener- 
ate clustered random networks with the algorithm in- 
troduced in [2]| with an exponential degree distribution 
and a degree-dependent clustering coefficient of the form 
c(k) — co(k — 1) _1 [30(. Notice that, when generating the 
networks, we fix the value of Co and not Too, as it is done 
in the derivation above. However, we can make use of 
Eq. Ijl8(l to go from one parameter to the other. Fig. 
shows a perfect agreement between the empirical values 
for the parameter P(l, 1) as a function of Co for different 
values of the average degree and the analytical results 
derived from Eq. i|17|) . In Fig. [3 we compare the relative 
size of the giant connected component as a function of 
Co with the numerical solution of Eqs. J7J) combined with 
the transition probability given in Eq. I|12|) . As it can be 
seen in the figure, the effect of clustering is to reduce the 
size of the giant connected component. The effect is so 
strong that, in networks with a moderate average degree, 
it completely fragments the network when Co exceeds a 
critical value. In other cases, the reduction of the size can 
be of more of the fifty percent. For values of c £ [0, 0.5], 




FIG. 3: Top graph: Bond percolation on networks with ex- 
ponential degree distribution ({k) = 2) and weak clustering 
(black filled circles), strong clustering (red filled squares), and 
zero clustering (blue filled triangles). For both types of clus- 
tering the size of the giant component of the original networks 
is smaller than the unclustered one. However, the percolation 
threshold for the network with weak clustering is smaller than 
the unclustered net whereas it is larger for the network with 
strong clustering, despite the fact that both networks have 
the same value of c. The inset shows the degree-dependent 
clustering coefficient for both networks. The area depicted 
in grey indicates the limits of weak clustering. Outside this 
region, the multiplicity of edges is necessarily larger than 1. 
Bond percolation simulations are performed over a single net- 
work of size N = 10 s and, then, averaged over 50 different 
realizations for each value of q. Bottom graph: Percolation 
threshold for strong and weak transitivity as a function of 
the global clustering C = (1 — P(l))~ 1 c. The blue dashed 
line is the percolation threshold for the unclustered network 
(<fc = 1/2). 



the agreement between our theory and numerical simula- 
tions is excellent. Beyond this point, our approximation 
slightly overestimates the GCC's size. This is mainly due 
to the fact that in this regime, links of multiplicity larger 
than 1 appear which, in turn, induces the presence of 
some loops of order four. 



III. PERCOLATION IN STRONGLY 
CLUSTERED NETWORKS 

In the strong transitivity regime, the solutions ob- 
tained from our theory become upper bounds to the real 
size of the giant component due to the fact that then 
higher order loops appear. Unfortunately, it is not easy 
to extend the analytical calculations to this case but one 
can resort to numerical simulations. Fortunately, we can 
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make use of the algorithm presented in Ref. [2q. which 
allows to generate random networks with a given degree 
distribution, a fixed degree-dependent clustering coeffi- 
cient, and at the same time, to exert some control on the 
assortativity level of the network. 

First, it is important to comment on the results by 
Newman who solved exactly the bond percolation 
problem for the one-mode projection of random bipar- 
tite graphs. One of the main results in that study is that 
clustering, although makes the giant component smaller, 
favors its onset, which seems to be just the opposite result 
to the one that we obtained in the case of weak transi- 
tivity networks. To solve this puzzle, we first need to 
understand how one-mode projection networks are con- 
structed. In a bipartite network, two types of vertices 
coexist, for instance scientists and scientific papers, with 
connections among them. The one-mode projection is 
then constructed by retaining just scientists and connect- 
ing them whenever they coauthor the same paper. That 
means that all papers with more than two authors will 
give place to cliques of connected scientists. Therefore, 
edges participating in triangles will have high multiplicity 
and the networks so generated will belong to the strong 
transitivity class. We shall see in the following that it 
is precisely the class the network belongs to -and not 
the scalar clustering coefficient- that determines whether 
clustering favors or not the onset of the giant component. 

To check this issue, we have generated two net- 
works with identical degree distribution (exponential 
with (k) — 2) and different forms for c(k), the first one 
with weak transitivity and the second one with strong 
transitivity. Then, we study their percolation properties 
by implementing a bond percolation experiment. We re- 
move each edge with probability q and measure the size 
of the giant component. Fig. [3] shows the results of this 
program. As it is clearly seen, in both types of networks 
the size of the giant component is smaller than that of the 
unclustered network. However, the percolation threshold 
for the weak transitivity class is smaller than the unclus- 
tered value, whereas it is larger in the strong transitivity 
class, despite the fact that both networks have the same 
value for the scalar clustering coefficient c. In Fig. |3 
we also show the behavior of the percolation threshold 
q c for different values of the scalar clustering coefficient 
C = c/(l - P(l)) (which is defined in the interval [0, 1]) 
for the strong and weak transitivity cases, which confirms 
this trend. 

Although illuminating, this analysis has the problem 
that we are comparing the resilience properties of net- 
works with different giant components. Despite that 
the degree distribution is the same for all the networks, 
it could not be the case for their giant components, 
potentially changing then the location of the critical 
point. To overcome this problem, we generate a network 
with strong transitivity and extract its giant component. 
Then, using the degree distribution of this giant compo- 
nent we generate two more networks, one without cluster- 
ing and the other one with weak clustering. Finally, we 




q 



FIG. 4: Results of the bond percolation process on networks 
with a degree distribution of the form P(k) ~ k 7 = 3.5, 
and strong clustering (red filled squares), weak clustering 
(black filled circles), and unclustered (blue filled triangles). 
In all cases, the original networks are single connected com- 
ponents. 




FIG. 5: Relative sizes of the giant fc-cores of networks with 
P(k) ~ k 1 and 7 = 3.5 for weak clustering, strong cluster- 
ing, and unclustered, respectively. 



check that these two networks are globally connected. We 
choose as the degree distribution to generate the strongly 
clustered network a function of the form P(k) ~ k~~ 7 , 
with 7 = 3.5. This particular form is convenient because 
using this distribution the range of degrees is relatively 
wide and, at the same time, the network still has a finite 
percolation threshold. We also restrict the degrees to 
values larger than two. Otherwise the unclustered net- 
work would not have a GCC. Fig. 0] shows the results 
for the bond percolation process in these three networks. 
As it is clearly seen, the size of the GCC of the highly 
clustered network decreases very fast when a small frac- 
tion of edges is removed whereas the unclustered one is 
more resilient in this regime. However, as q increases, the 
unclustered network undergoes the phase transition (at 
q c = 0.4) whereas the highly clustered net shows a sur- 
prising resilience, decaying very slowly as q increases. On 
the other hand, the weakly clustered network is always 
less resilient than the unclustered one and undergoes the 
phase transition at a lower level of damage. These results 
confirm those shown in Fig. [21 

To understand which is the origin of this behavior and 
the differences between weak and strong clustering, we 



need to understand the kind of structures that are formed 
depending on the clustering properties. The concept of 
fc-core is particularly suitable for this purpose (see [2l| 
for a very nice work on fc-core percolation in random 
networks). The fc-core of a network is the maximal sub- 
graph such that all its vertices have fc or more connec- 
tions within the subgraph. Therefore, fc-cores are sub- 
graphs which are particularly resilient to the removal of 
edges if fc is large. In Fig. we show the relative sizes 
of the giant fc-cores of the networks of Fig. 01 as a func- 
tion of k compared with the cumulative degree distribu- 
tion P c (k) — ^2 k / >k P{k'). This comparison is in order 
because P c {k) is the maximum possible value the giant 
fc-core can attain. In the cases of weak clustering and 
unclustered network, the entire network forms a giant 2- 
core but the fc-cores for fc > 2 do not exist. This result 
can be understood using results from [2l| that state that, 
in random networks, the giant fc-core undergoes a fc-core 
percolation transition which is discontinuous, similar to 
what happens in first order phase transitions. In con- 
trast, the strong clustered network has fc-cores for k > 2 
which do not vanish, with a constant fraction of P c (k) 
belonging to them. This behavior extends up to degree 
k = 7, and the fc-core finally disappears at fc = 8. This 
result explains the strong resilience of the strongly clus- 
tered network since, although small, these fc-cores with 
high fc are extremely difficult to break. This suggests that 
the properties of the critical percolation threshold in this 
class of networks should be tied to their fc-core perco- 
lation properties. This is an issue that deserves further 
consideration and will be addressed in a future work. 



IV. PERCOLATION IN SCALE-FREE 
CLUSTERED NETWORKS 

Random scale-free (SF) networks with 2 < 7 < 3 have 
the peculiarity of not having a percolation threshold or, 
equivalently, q c = 1. This means that one has to remove 
the totality of the edges to break the network into dis- 
connected components. In epidemiological language, it 
means that any disease, even a low infectious one, can 
propagate and infect macroscopic portions of the popu- 
lation. Given the important implications of this result, 
it is necessary to discern whether it can be applied to 
more general types of networks. For instance, it has been 
proved that two-vertices degree-degree correlations can- 
not restore a finite epidemic threshold 0,[2^|. However, 
this result cannot be applied to networks with high levels 
of clustering since the networks used in the demonstra- 
tion had a vanishing clustering coefficient in the thermo- 
dynamic limit. 

As we have already seen in previous sections, weak 
transitivity hinders the onset of the giant component, 
with a condition for its existence, in the uncorrelated 
case, given by 




(fc(fc-l)) 
(fc) 



> (1 + c (m )) 



m 



c (m ) 



(1 



(fc) 



)• (20) 



FIG. 6: Top: results of the bond percolation process on a 
network with a degree distribution of the form P(k) ~ fc~ 7 
and 7 = 2.5 and its randomized version. The inset shows a 
detail close to q = 1. Bottom: relative sizes of the giant fc- 
cores of these two networks. The solid line is the cumulative 
degree distribution P c (k). 



In the case of SF networks, the term on the left in this 
inequality diverges and, therefore, the condition is always 
fulfilled. In short, this means that weak transitivity can- 
not restore a finite percolation threshold. 

When the network belongs to the strong transitivity 
class, its percolation properties are not much different 
from networks with bounded fluctuations, except for the 
fact that the critical threshold is located at q c = 1. In 
Fig- El we repeat the same analysis performed in the pre- 
vious section but now for a SF network. We first generate 
a highly clustered SF network with 7 = 2.5 and extract 
its giant component. Then, we randomize it to obtain an 
unclustered random network preserving the degree dis- 
tribution. Finally, a bond percolation process is applied 
to both networks. As in the case of 7 = 3.5, the clus- 
tered network is less resilient than the unclustered one 
except for high levels of damage, where the clustered net 
is more resilient. In this case, the fact that we find a 
finite threshold is due to finite size effects. Again, the fc- 
core analysis reveals a nested structure of fc-cores up to 
fc = 65 following closely the shape of P c (k). In contrast, 
the fc-cores for the unclustered network decays very fast 
as fc increases. This result implies that strong transitivity 
cannot restore a finite percolation threshold either. 



V. PERCOLATION IN REAL NETWORKS: THE 
CASE OF THE PGP NETWORK 

We cannot end this paper without taking a look at 
the real world and checking if our results applies also 
there. To this end, we chose the P retty - Good-Privacy 
(PGP) web of trust analyzed in Ref. 22]. This is a nice 
example of a large social network based on trust. It arises 
as a consequence of the need for secure communications 
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k 

FIG. 7: Top: bond percolation experiment performed on the 
giant component of the PGP network and on its randomized 
version. The inset shows a detail close to q = 1. Bottom: 
relative sizes of the giant fc-cores for the PGP network and 
its randomized version, respectively. The solid line is the 
cumulative degree distribution P c (k). 

through the Internet. Without going into great detail, 
when a user A wants to send a message to another user 
B, she encrypts the message using the public key of user 
B who, afterwards uses her private key to decrypt it. In 
this way privacy is ensured. However, since everybody 
can generate his own pair of keys, it is not possible, in 
principle, to be sure that the person holding the key is 
who she claims to be. An imaginative solution to this 
problem is the web of trust. In this web, any user can 
sign the public key of another user, meaning that she 
trusts the other person is who she claims to be. This 
procedure generates a publicly available web of trust of 
users that have signed the public keys of other users. In 
principle, this web is directed. However, since we are 
interested in social ties, we filter out those connections 
that are not reciprocal. In this case, an edge among 
two persons is likely to represent a social relationship 
between them. After the filtering process, we obtain a 
network with N — 57243 vertices and a giant connected 
component of N gcc — 10680 vertices. The network has 
a degree distribution with a heavy tail (although not a 
pure power law) extending up to k ~ 200 and a degree- 
dependent clustering coefficient which is constant up to 
degree k = 50 followed by a sharp decay for k > 50, 
with an overall value of 0.5. This property sets the PGP 
network at the heart of the strong transitivity class. 

The bond percolation experiment performed on the gi- 
ant connected component of the PGP network reveals the 
same type of pattern that we have found before and is 
shown in Fig. \7\ Again, the randomized version is more 
resilient except for very high values of q, where the PGP 
net is more resilient, with the critical point closer to 1. 
The A;-core decomposition also shows the same type of 
result. The PGP network has a nested A:-core structure 
extending up to k = 31, whereas the randomized network 



has only fc-cores up to k — 7. 

We would like to stress that this behavior is by no 
means exclusive to the PGP network since many net- 
works in the real world belong to the strong transitiv- 
ity class. The results presented here are extremely rele- 
vant in the case of epidemic spreading. On the bad side, 
they suggest that real clustered networks are more prone 
to suffer epidemic outbreaks than unclustered networks. 
Yet, the relative size of the potentially infected popu- 
lation is smaller, which is indeed a positive result. We 
would also like point out that, the knowledge of the role 
that the giant fc-cores have on the percolation properties 
could be used to design and plan more effective immu- 
nization strategies. 

VI. CONCLUSIONS 

In this second paper, we have presented an analyti- 
cal approximation to percolation in weakly clustered net- 
works. Although this formalism is exact only in the limit 
of weak transitivity, it is an upper bound for the size of 
the GCC in all cases. Using this approach, we have seen 
that weakly clustered networks percolate at a higher den- 
sity of connections as compared to the unclustered ones. 
By means of numerical simulations, we have also proved 
that the percolation threshold for networks in the weak 
transitivity class is smaller than the corresponding to an 
unclustered network with the same degree distribution. 
In contrast, this threshold is larger for nets in the strong 
transitivity class. This means that weak clustering hin- 
ders the appearance of the giant connected component 
whereas it is favored by strong clustering. To understand 
which is the origin of this behavior and the differences be- 
tween weak and strong clustering, we have explored the 
structural organization of networks through their fc-core 
decomposition, finding important differences among the 
two classes. In the case of scale-free networks, we have 
seen that neither weak nor strong transitivity can re- 
store a finite percolation threshold. We have checked our 
results using a real social network, finding a very good 
agreement. 

To summarize, in this paper and the preceding one we 
have developed a full theoretical approach to clustering 
in complex networks. We hope that these developments 
will improve the topological characterization of complex 
networks but also that they will help to produce a bet- 
ter understanding and a more realistic modelling of the 
dynamical processes that conform or use them. 
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